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Abstract. The Birch and Swinnerton-Dyer conjecture states that the rank of the 
Mordell-Weil group of an elliptic curve E equals the order of vanishing at the central 
point of the associated L-function L(s,E). Previous investigations have focused on 
bounding how far we must go above the central point to be assured of finding a zero, 
bounding the rank of a fixed curve or on bounding the average rank in a family. Mestre 
|Mes| showed the first zero occurs by 0(1/ log log Ne), where Ne is the conductor 
of E, though we expect the correct scale to study the zeros near the central point is 
the significantly smaller 1/logAT^. We significantly improve on Mestre's result by 
averaging over a one-parameter family of elliptic curves, obtaining non-trivial upper 
and lower bounds for the average number of normalized zeros in intervals on the order 
of 1/ log Ne (which is the expected scale). Our results may be interpreted as providing 
further evidence in support of the Birch and Swinnerton-Dyer conjecture, as well as 
the Katz-Sarnak density conjecture from random matrix theory (as the number of zeros 
predicted by random matrix theory lies between our upper and lower bounds). These 
methods may be applied to additional families of L-functions. 



1. INTRODUCTION 

The goal of this paper is to provide evidence towards the Birch and Swinnerton- 
Dyer conjecture in one-parameter families of elliptic curves. We briefly summarize our 
results, assuming the reader is familiar with the notation and subject. Afterwards we 
review the needed background material from elliptic curves and previous results in §0 
for the convenience of the reader, we state all the conjectures assumed or discussed at 
various points in Appendix lAl We then prove our theorems and discuss generalizations 
to other families of L-functions in §[3l where we give explicit non-trivial upper and 
lower bounds. 

The Birch and Swinnerton Dyer conjecture asserts that if E is an elliptic curve whose 
Mordell-Weil group E{Q) has geometric rank r, then the associated completed L- 
function A(s, E) has analytic rank r (i.e., it vanishes to order r at the central point). 
This is an exceptionally hard problem to investigate, theoretically and numerically. 
While there is some theoretical evidence when the rank is at most 1, the general case is 
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intractable both theoretically and experimentally. For example, although we can con- 
struct elliptic curves with geometric rank exceeding 20, the largest known lower bound 
for the analytic rank of a A (s, E) is only 30 

We consider the following natural question. Let E be an elliptic curve with geo- 
metric rank r, and assume the Generalized Riemann Hypothesis (GRH). The Birch and 
Swinnerton-Dyer conjecture predicts that there should be r zeros at the central point. 
How far must we go along the critical line before we are assured of seeing r zeros? 

If Ne denotes the conductor of the elliptic curve, we expect the correct scale for 
zeros near the central point to be of size l/logA^. Miller UMiBII investigated the 
first few zeros above the central point for the family of all elliptic curves as well as 
one-parameter families of small rank over Q(T). His results are consistent with the 
low zeros being of height on the order of 1/ log A^; however, the first few zeros are 
higher than the Ne — > oo scaling limits predicted by the independent model of random 
matrix theory. The data suggests that, for finite conductors, better agreement is obtained 
by modeling these zeros with the interaction model (which involves Jacobi ensembles). 
Determining the correct corresponding random matrix ensemble involves understanding 
the discretization of the central values of L-functions and the lower order terms in the 
1 -level density. In his thesis Due Khiem Huyn [ |Huy[ successfully modeled the first 
zero of the family of quadratic twists of a fixed elliptic curve, and current work by the 
second named author and Eduardo Duenez, Due Khiem Huynh, Jon Keating and Nina 
Snaith is investigating the case of a general one-parameter family [DHKMS |. 

The best theoretical result on the first zero above the central point is due to Mestre. 
Assuming the Generalized Riemann Hypothesis, Mestre HMesI bounded the analytic 
rank of E by 0(log Ne / log log Ne) and showed its first zero above the central point is 
at most Bj log log Ne- While this is significantly larger than what we expect the truth 
to be, namely 0(1/ log N E ), it has the advantage of holding for all elliptic curves. 

In this note we show that we may reduce the window on the critical line to something 
of the expected order if we average over a one-parameter family of elliptic curves. 
Specifically, consider a one-parameter family S : y 2 = x 3 + A(T)x+B(T) of geometric 
rank r over Q(T), with A(T), B(T) £ Z[T). For each t £ Z we may specialize and 
obtain an elliptic curve E t : y 2 = x 3 + A{t)x + B{t) with conductor N t := Ne^ By 
Silverman's specialization theorem [Sil2], for all t sufficiently large each elliptic curve 
Et has geometric rank at least r. Assuming standard conjectures, Helfgott HHel proved 
that for a generic family the sign of the functional equation is 1 half the time and -1 
the other half. It is believed that a generic curve in a generic family has analytic rank 
as small as possible consistent with all constraints. In our case, as the rank must be at 
least r if the Birch and Swinnerton-Dyer conjecture is true, we expect that in the limit 
half the curves will have analytic rank r and the other half r + 1, for an average rank of 
r + - 



The number of terms needed for the computation is on the order of the square-root of the conductor 
of E, which grows rapidly in families. While it is possible to numerically show that the first r Taylor 
coefficients of A(s, E) are close to zero for many E's with geometric rank r, in general these compu- 
tations can only provide evidence. The exception is when we have formulas for the derivatives as a 
known quantity times a rational, in which case we can convert these calculations to proofs of vanishing. 
See http : / / web . math . hr/~du je/tors /rk28 . html for an example by N. Elkies of an elliptic 
curve with geometric rank at least 28. 
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We take our family to be Tr := {A(s, E t ) : R < t < 2R} with R — > oo, though we 
often abuse notation and use to denote t in [R, 2R] . There are two ways to normalize 
the zeros of A(s, E t ) near the central point: (1) globally, using := Y^t&T ' 
(2) locally, using lo | — . It is significantly easier to use the global rescaling; however, as 
each elliptic curve can be considered independent of the family, it is more correct to use 
the local rescaling (in this case, due to the technicalities that arise we must add some 
additional restrictions on which t G [R, 2R] are in the family). 

Before stating our main result, we must first introduce some notation. All conjectures 
are stated in full in Appendix lAl 

Definition 1.1 (Sieved family). Let £ : y 2 = X s + A(T)x + B(T) be a one-parameter 
family of elliptic curves over Q(T) with discriminant A(T), let D{T) be the product of 
the irreducible polynomial factors of the discriminant, and let B be the largest square 
dividing D(t) for all integers t. For a fixed c, to, our family is the set of all t = ct' + to 
(with t G [R, 2R]) such that D(ct' + t ) is square-free except for primes p\B where 
the power of such p\D(t) is independent oft. In HMil21 it is shown that for any one- 
parameter family, there is a choice of c and to such that the number of such t is cgR + 
o(R) for some Cs > if every irreducible polynomial factor of A(T) has degree at most 
3 ( if not, the claim is true if we assume either the ABC or Square-free Sieve Conjecture ). 
We let T' R denote the sieved family. 

Definition 1.2 (Average number of zeros in a family). Let £ : y 2 = x 3 + A[T)x + B(T) 
be a one-parameter family of elliptic curves over Q(T) with specialized curves E t with 
conductors N t . Assume GRH and write the non-trivial zeros of A(s, E t ) as \ + i^tj, 
and set 

\ogN_ ^^logAi 

t=R 

The average number of zeros with imaginary part at most r (in absolute value) under 
the global and local renormalizations are defined to be 

*£33m - i£#{^^[-T,r]} 

ter' R 

The Birch and Swinnerton-Dyer conjecture implies that, for families where half the 
curves have even and half have odd sign, 

7 (global)/ n N 7 (local) , n s > 1 

Our main results are upper and lower bounds for how many normalized zeros there 
are on average in the interval [— r, r), in particular, how small we may take r and be 
assured on average that there are r + | zeros in the interval. 

Theorem 1.3. Let S be a one-parameter family of elliptic curves of geometric rank r 
over Q(T); if £ is not a rational surface (see Remark \A.l\ for a definition) then assume 
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Tate's conjecture. Additionally, if we are using the local renormalization of the zeros 
we must assume either the ABC or the Square-free Sieve Conjecture if the discriminant 
has an irreducible polynomial factor of degree at least 4. 

Let a be chosen such that we can compute the 1-level density (defined in C?.;?D for 
even Schwartz test functions (f> with supp(</>) C (—cr, cr); see Theorem \2.3[ for details on 
what cr are permissible for a given family. 

Then 

• Lower bounds for the average number of normalized zeros in [— r, r]: Let 

the notation be as in Definition 17.21 and assume GRH. Let h be any even, twice 
continuously differentiable function supported on [—1, 1] and monotonically de- 
creasing on [0, 1]. For fixed r > let f(y) = h(2y/cr), g(y) = (f * f)(y) 
(the convolution of f with itself), and let 4>(x) equal the Fourier transform of 
g(y) + (2nr)^ 2 g"(y). Note supp(0) C (—a, a) and (f>(x) is non-negative for 
\x\ < r and non-positive for \x\ > r. Then 

-(global), N (local) , s f l\ 0(0) / log log R \ 



where 

0(0) {^h(ufdu) + {^njlh(u)h"(u)du) 



0(0) a{Jlh{u)duf 



(1.4) 



If we let tbsd(o") denote the value ofr such that we are assured of at least r + | 
zeros on average (as R — > ooj in [— r, r] given that we can compute the 1-level 
density for test functions whose Fourier transform is supported in (—cr, cr), then 




i \ ~V 2 

L h(u) 2 du \ 1 1 

■'" 1 - (1.5) 



^h{u)h"{u)du) cr ' nC(h)cr' 

This should be compared to the predictions from the Birch and Swinnerton- 
Dyer and Parity Conjectures for a generic family, which predict tbsd(c) = 0. 
In particular, taking 

f(l-x 2 )(l- 0.233428x 2 + 0.0189588x 4 ) if\x\<l 
h(x) = < 7V (1.6) 

I otherwise 

yields 

tbsd^) < -7^, (L7) 
7rC (li)cr 

where C{h) ~ 0.63662 (which is approximately 2/n); note l/TrC(h)cr is ap- 
proximately 1 / 2a. In the arguments below we use 2/tt for brevity without re- 
minding the reader that the numerical calculation is only close to the above. 

Upper bounds for the average number of normalized zeros in [— r, r): Let ip be 

a twice continuously differentiable even Schwartz test function with supp(^) C 
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{—a, a), ip(x) > for all x, andip(x) monotonically decreasing on [0,r). Then 

^(global)/ \ ^(local) / x 

^ /V 1\ , (i-+;W(0)-fr)) + #) , / loglogii \ 

- l r+ ij + W) + \m^) <L8) 

If we consider the interval oz) from the lower bound, taking ip(x) = 

{^r) y ields the 

average number of normalized zeros in the limit in this 
interval is at most (r + ~ + ±) /^(l/2a) = ^ (r + ~ + ~J. 
• Random matrix theory prediction. Let £ be a generic one-parameter family 
of elliptic curves of rank r over Q(T) wzY/z half of the specialized functional 
equations even and half odd. Assuming the Katz-Sarnak Density Conjecture, as 
R ^ oo the average number of normalized zeros in [— t, t] is (r + =) + 2r; 
more precisely, random matrix theory predicts 



1 

^ — ave,y- \ ' n ~ &ve j- tt \ ' / ' q 

it— >oo z 



^"i(r),^yr) =r + ^ + 2r. (1.9) 



Zn particular, setting r = ^ yields a prediction of r + | + 2 ■ ^ normalized 
zeros in the limit on average. 



In summary, the number of normalized zeros on average as R — > oo in f/ze interval 

{-h>h) satis fy 

^(global)/ s ^(local) , * - ^ ( ' 



^ + 2 < 3E£2(t), Z£&(t) < + - + (1.10) 

anJ z^/h's interval contains the prediction from Random Matrix Theory, r + | + -. 

Remark 1.4. We obtained our upper bound for tbsd(°") setting (f)(0) / 0(0) = 0. TTze 
important item to note is that Tbsd(c") (or any r) is inversely proportional to the support 
a. In other words, the larger we may take a, the more we may concentrate <fi near the 
central point and thus the smaller the window. Random matrix theory predicts we may 
take o arbitrarily large, which would imply we may take r arbitrarily small and thus 
prove the Birch and Swinnerton-Dyer conjecture on average. 

2. Background material and previous results 

2.1. Elliptic curves. We quickly review the needed background material on elliptic 
curves; the reader familiar with the notation and theory may safely skip this subsection. 
See [KoH ISlTTl ISTll for proofs, as well as the survey Ipfell 

Let E be an elliptic curve over Q, say y 2 = x 3 + ax + b with a, b E Z, and set 

E(Q) := {(x,y) G Q 2 : y 2 = x 3 + ax + b}. (2.1) 



We can define addition of two elements of E(Q) as follows (see Figured])- If -P = 
(xi,yi) and Q = (x 2 ,i/ 2 ) are m E(Q), then the line y = mx + b connecting them 
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Addition of distinct points P and Q Adding a point P to itself 
Figure 1 . The addition law on an elliptic curve. In the second example 
the line is tangent to E at P. 

has rational coordinates @ Substituting this expression for y into the elliptic curve, we 
find (mx + b) 2 = x 3 + ax + b. This is a cubic in x with rational coefficients. By 
construction two of its roots are x\ and x 2 , both rational numbers. Thus the third root, 
say x 3 , must also be rational. Set R(P, Q) = (x 3 , \/xf, + ax 3 + b) and R(P, Q) = 
(x 3 , — + ax 3 + ^ we define addition by P © Q = R(P, Q), then this (plus 



adding a 'point at infinity') turns E(Q) into a finitely generated abelian group. We 
write E(Q) as Z r © T, where T is a torsion grourjj and r is called the geometric rank of 
the curve. 

Given an elliptic curve E as above, we may associate an L-function as follows. As- 
sume y 2 = x 3 + ax + b is a globally minimal Weierstrass equation for E/Q with 
discriminant A = — 16(4a 3 + 27b 2 ) and conductor Ne- Set 

Oe(p) := p — y) G (Z/pZ) 2 : y 2 = x 3 + ax + b mod p}. (2.2) 

Note that the oe(p)'s encode local data, specifically the number of solutions modulo p. 
Hasse proved |oe(p) | < 2y^p, and we define the L-function by 



L(s,£) 



n 

p[A 



i _ 



n 



(2.3) 



we have included the factors of ^fp so that the completed L-function has a functional 
equation from s to 1 — s and not 2 — s: 



A(s,£) := 



AT 



2^ 



r( S + -)L( S ,E) = e^A(l-a,S), (2.4) 



where e# G {1,-1} is the sign of the functional equation. Following the work of 
Wiles HWiL Taylor- Wiles HTW1 and Breuil-Conrad-Diamond-Taylor [sBCDTj, we may 
associate a weight 2 modular form / to any elliptic curve E, where the level of / equals 
the conductor N E of E. We have A(s, /) = A(s, E); in particular, the completed L- 
function converges for all s. We call the order of vanishing of A(s, E) at s = 1/2 the 
analytic rank of E 1 . 



We assume the two points are distinct; if they are the same, the argument below must be slightly 
modified. 

3 Mazur IIMal proved that torsion group is one of the following: Z/iVZ foriVe {l,2,...,10,12}or 
Z/2 x Z/2iVZ for AT e {1,2,3,4}. 
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The Birch and Swinnerton-Dyer conjecture HBS-Dll IBS-D2H states^ that the order 
of vanishing of A(s, E) at the central point s = 1/2 equals the rank of the Mordell- 
Weil group E(Q), or that the analytic rank equals the geometric rank. Sadly, we are 
far from being able to prove this, though the evidence for the conjecture is compelling, 
especially in the case of complex multiplication and rank at most 1 [Bro, CW, GKZ, 
IGZl IKol fl IKol2[ iRul . In addition there is much suggestive numerical evidence for the 
conjecture; for example, for elliptic curves with modest geometric rank r, numerical 
approximations of the first r — 1 Taylor coefficients are consistent with these coefficients 
vanishing. 

2.2. Explicit Formula. One powerful tool for investigating the Birch and Swinnerton- 
Dyer conjecture is the Explicit Formula (see [RS] for a proof for a general L-function, 
or UMilll for the calculation for elliptic curves), which connects the zeros of an L- 
f unctions to the Fourier coefficients. 

Theorem 2.1. Let (f> be an even, twice continuously differentiable test-function whose 
Fourier transform 

/DO 
<P(x)e- 2nixy dx (2.5) 
-oo 

has compact support, and denote the non-trivial zeros of A(s, E) by \ + {under 
the Generalized Riemann Hypothesis, each G R). Then 

4r V 2tt y VK ' VK > ^ p\ogN E V \\ogN E ) 

ri p 

^ a 2 E {p)\ogp 2 / 21ogp ^ f log\ogN E \ 

Using the explicit formula, Mestre proved^ 

Theorem 2.2 (Mestre [MesJ). Assuming the Generalized Riemann Hypothesis: 

(1) The order of vanishing at the central point is 0(\ogNE/ log log N E ). 

(2) There is an absolute constant B such that the first zero above the central point 
occurs before B / log log N E . 

From the functional equation, however, we expect the first zero above the central 
point to be on the order of 1/logA 7 ^, and not 1/ log log N E . Thus Mestre's result is 
significantly larger than what we expect the truth to be; however, it holds for any elliptic 
curve. The situation is very different if instead we consider families of elliptic curves. 
By averaging the explicit formula over the family and exploiting cancelation in the sums 
of the Fourier coefficients a E (p), it is possible to prove (on average) significantly better 
results. 

4 There is a more precise form of the conjecture which relates the leading term in the Taylor expansion 
to the period integral, regulator, Tamagawa numbers and the Tate-Shafarevich group, but this version is 
not needed for our purposes. 

5 Mestre actually proved more, as his results hold for any weight k cuspidal newform, and not just 
elliptic curves (which correspond to weight 2 cuspidal newforms). 
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Numerous studies have been concerned with bounding the average rank in families. 
We list some of the frequently studied families below (note that, for technical reasons, 
often one has to do some sieving and remove some curves in order to make certain 
sums tractable). These results are obtained by averaging the explicit formula over some 
family Tr, where R is a parameter localizing the conductors, and sending R — > oo. 

• The family of all elliptic curves: y 2 = x 3 + Ax + B, and Tr = {(A, B) : \A\ < 
R 2 , \B\ < R 3 } (or something along these lines). 

• One parameter families over Q(T): y 2 = x 3 + A(T)x + B(T), with A(T), B{T) 
G Z[T] and either Tr = {t : R < t < 2R} or a sub-family of this where the 
conductors are given by a polynomial. 

• Quadratic (or higher) twists of a fixed elliptic curve: dy 2 = x 3 + ax + b, with 
Tr = {d : d < R a fundamental discriminant}. 

The current record belongs to M. Young HYo21 , who showed the average rank in the 
family of all elliptic curves is bounded by 25/14 « 1.79; results for one-parameter fam- 
ilies and quadratic twist families are significantly worse. For a sample of the literature, 
see HBMSWl iBrul IBM ICPRWl IDFKl iGaol iGol iGMl M-B[ [KowTl lKow2l IMil lMll2l iRSil 
iRugfl 15351 lYo21 EH (especially the surveys HBMSW1 iKowTl iRuSil ) . 

2.3. The one-level density. For a family Tr of L-f unctions ordered by conductor (with 
R — > oo), the averaged explicit formula is called the one-level density. Specifically, let 
4> be an even Schwartz test-function whose Fourier transform is supported in (— er, a), 
and denote the zeros of L(s, f) by 1/2 + 27^ (under GRH each 7^ e W). Let Nf 
denote the analytic conductor of L(s, f). We define the one-level density by 

1 Hl fer R / v 7 

This statistic has been fruitfully used by many researchers to study the zeros of elliptic 
curves L-functions (as well as other families of L-functions) near the central point. 

Unlike the n-level correlations, which are the same for any cuspidal newform arising 
from an automorphic representation (see [Hejj IMonl IRSII ). the one-level density for a 
family of L-functions depends on the symmetry of the family. Katz and Sarnak HKSll 
IKS2I conjecture that families of L-functions correspond to classical compact groups; 
specifically, the behavior as the conductors tend to infinity of zeros (respectively values) 
of L-functions is well-modeled by the limit as the matrix size tends to infinity of roots 
(respectively values) of characteristic values of random matrices^ They conjecture that 

lim D FR {(j>) = [ (P(x)W Gi ^{x)dx, (2.8) 

R-»oo J 

where G(T) indicates unitary, symplectic or orthogonal (possibly SO(even) or SO(odd)) 
symmetry; this has been observed in numerous families. Note by Parseval's theorem 



6 These conjectures are a natural outgrowth of observed similarities between behavior of L-functions 
and matrix ensembles. While random matrix theory first arose in statistics problems in the early 1900s 
(see for example IIWislD . it blossomed in the 1950s when it was successfully applied to describe the 
energy levels of heavy nuclei. Its connections to number theory were first noticed by Montgomery IMonl 
and Dyson in the 1970s in studies of the pair correlation of zeros of C(s). See |FM| for a survey on the 
development of the subject and some of the connections between the two fields. 
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that 

J <f>(x)W G{ ^(x)dx = j mW G{n {y)dy. (2.9) 

Let I(u) be the characteristic function of [—1, 1]. Katz and Sarnak prove the Fourier 
transforms of the one-level densities of the classical compact groups are 

Wso(even) («) = 5(u) + -I(u) 



W so (u) = 5(u) 
W SO (odd)(u) = S(u)-^I{u) + l 



2 

W VSp (u) = 5{u)- l -I{u) 
W v (u) = 5(u). (2.10) 

For functions whose Fourier Transforms are supported in [—1, 1], the three orthogonal 
densities are indistinguishable, though they are distinguishable from U and Sp. To 
detect differences between the orthogonal groups using the 1-level density, one needs 
to work with functions whose Fourier Transforms are supported beyond [— 1, 1]Q 

For families of elliptic curves with rank, it is useful to consider additional subgroups 
of the classical compact groups above. We consider the iV — > oo scaling limits of 
matrices of the form 

I rr 

9 

where I r ^ r is the r x r identity matrix and g is an iV x iV orthogonal matrix (drawn 
from either the full orthogonal family or one of the split families, namely even or odd). 
These matrices have r forced eigenvalues at 1 (or r eigenangles at 0) for each g; thus as 
we vary g in one of the three families we obtain the same one-level densities as before 
except for an additional factor of r. Explicitly, 

W r . >SO ( even ) (u) = 5(u) + -I{u) + r 

W r] so(u) = 5(u) + ^ + r 

Wr ;SO (odd)(w) = 5(u)--I(u) + l + r. (2.11) 

For our elliptic curve families, we must evaluate the average over Tr or T' R of (|2.6I) . 
Note that almost all of the conductors will be a bounded power of R for t £ [R, 2R\. If 



One can also distinguish between the various orthogonal groups by looking at the 2-level density, 
as these three ensembles have distinct behavior for arbitrarily small support; see for instance [ Mil2l . If 
n > 3, the determinan expansions for the n-level density are hard to work with; in fact, in Gao's thesis 
|Gao | he is able to compute the number theory and random matrix theory results for greater support than 
he can show agreement. In place of the determinant formulas, one can also use expansions from |HM|; 
though these hold for smaller support, they are sometimes easier for comparisons. 
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we rescale each elliptic curve E's zeros by the correct local factor, namely (log N E )/27T, 
we have 

IB 

{Fr] *-± *-? p\ogN E \\ogN E J 



\F r \ <-t p 2 \ogN E \\ogN E J \ \ogR 

The difficulty with this expression is that, as the conductors are varying, we cannot 
easily pass the sum over the family through the test-function to the Fourier coefficients 
a E (p) and a E (p) 2 . By sieving it is possible to surmount these technical details; this is 
the main result in HMil2H . 

If instead we rescale each elliptic curve 75" s zeros by the global factor, namely 

logiV = 1 y logNz 



ter R 



then we find 



\F R \ *-f p\ogN \\ogN J 



Eer R p 

,2 



1 x - ^ a E (p)\ogp 2/^ 21ogp ^ / log log iV \ 



The analysis is significantly easier here, as now we can pass the summation over the 
family past the test-function and exploit cancelation in sums of the Fourier coefficients 
a E (p) and a E (p) 2 . 

We quote the best known results for general one-parameter families. 

Theorem 2.3 (Miller liMiTlllMiT2T ). 
Notation: 

• Let S be a one-parameter family of elliptic curves of geometric rank r over 
Q(T). 

• Let <p be a twice continuously differentiable function^with supp(0) C (— er, a). 

• Consider the sieved family ( see Definition 17.71) . and denote the degree of the 
conductor polynomial by m. 

• Let G denote either SO, SO (even) or SO (odd). 

Assume 



While the theorem was proved under the assumption that cf> is Schwartz, a careful analysis of the 
argument reveals it suffices that <fr be twice differentiable. 
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• If £ is not a rational surface (see Remark \A.l\ for a definition) then assume 
Tate 's conjecture. 

• If the discriminant has an irreducible polynomial factor of degree at least 4, 
assume either the ABC or the Square-free Sieve Conjecture. 

Then 

D X ^\<P) = J i{y)W r , G {y)dy = ( r + l^{0) + l^Q)+^^^\ (2.15) 

provided a < min (1/2, 2/3m); a similar result holds for .D|i° bal (0) (without the as- 
sumptions that 8 satisfies Tate 's hypothesis and without assuming either the ABC or 
Square-free Sieve Conjecture). 

Remark 2.4. We briefly discuss some consequences and generalizations of the above 
theorem. 

• Similar statements hold for quadratic twist families and the family of all elliptic 
curves. 

• The above result provides support that the zeros of one-parameter families of 
rank r over Q(T) are modeled by the scaling limits of orthogonal matrices with 
r independent eigenvalues of I. 

• As supp(0) C (—1,1), the three orthogonal groups have indistinguishable one- 
level densities. We can see which group correctly models our family by studying 
the 2-level density, which requires us to understand the distribution of signs of 
the functional equations in our family. 

3. Proof of Theorem II .31 

3.1. Preliminaries. Before proving Theorem 11.31 we first prove general results for 
the upper and lower bounds in a window of variable size for a general family of L- 
functions. Theorem 1 1.31 then follows immediately from Theorem l3.1[ Theorem 12.31 and 
the constructions of test-functions satisfying the necessary conditions, which are given 
below. 

Theorem 3.1. Let T R denote a family of L-f unctions, and let Z^°]^ r (t), Z^°^ R (r) 
be defined as in Definition U .21 Assume for both normalizations of the zeros that there 
are constants a and b such that 

DrM = am + 60(0) + O ( lo f logR ) , (3.1) 



logR 

whenever <p or ip is a twice continuously differentiable function with Fourier transform 
supported in (—a, a). If '<j>(x) > Ofor \x\ < r and (f>(x) < whenever \x\ > r, and if 
4>(x) is largest when x — 0, then 

7 (global) M 7 (local) ( ) ,0(0) Q ( lgg lgg R \ _ 

while ifijj(x) > Ofor all x and is monotonically decreasing on (0, r), then 

7 ( g iobai) r s 7 (iocai) , , a(^(0)-V>(r)) + 6^(0) / loglog/? \ 

Z^ >R {r), Z^ R (r) < a + _ + U [ miQgR ) ■ (3-3) 
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Proof. We give the proof for the local rescaling; the global case follows analogously. 
As (f>(x) is non-positive for \x\ > r, the contribution to the one-level density from the 
scaled zeros as large or larger than r in absolute value is non-positive; thus if we remove 
these contributions then the one-level density gives the lower bound 



As is maximized at 0, we increase the left-hand side above by replacing 0(7^) with 

0(0); doing so and dividing by 0(0) yields the claimed bound for Z^°°^ r (t). The 
upper bound is proved analogously. □ 

Remark 3.2. These results are of course not of interest unless we are able to construct 
and ip satisfying the conditions in Theorem \3.1\ For one-parameter families of elliptic 
curves of rank r over Q(T), we have a = r + | and 6 = 1. 

Remark 3.3. For test functions whose Fourier transform is supported in (—1,1), all 
known one-level densities of families of L-functions are in the form of Theorem 13.71 
and thus our results are immediately applicable. For some families where the support 
exceeds ( — 1,1) (such as families of cuspidal newforms of square-free level split by sign 
of the functional equation), a little more work is needed as the functional form of the 
one-level density is different^ For ease of exposition in this paper we confine ourselves 
to the (— 1, 1) case. 



3.2. Proof of Theorem 11.31 The main step in the proof of Theorem ll.3l is showing that 
our result is non-vacuous by constructing an d ip with the claimed properties. Our 
construction of is almost surely similar to the construction implicit in Mestre's work 
HMesll ; see also Hughes and Rudnick [HRJ. 

Proof of the Lower Bound in Theorem \1.3\ We give the lower bound for the number of 
zeros in [— r, r] by constructing a good test function 0. As our results depend on the 
support of (which is finite), it is convenient to normalize our test function and express 
everything in terms of h, which we take to be an even, twice continuously differentiable 
function supported on (—1, 1) and monotonically decreasingon [0, 1). For fixed a, r > 
let f(y) = h(2y/a), g(y) = (/ * f){y) (the convolution!!] of / with itself), and let 
4>{x) equal the Fourier transform of g(y) + (2itT)~ 2 g"(y). We must show (i) supp(0) C 
(— <7, a) and (ii) 0(x) is non-negative for \x\ < r and non-positive for |x| > r. 

The proof of (i) follows from standard properties of convolution. Specifically, as 
supp(/) C (—a/2, a/2), we have supp(g) C (—a, cr)E3 As the support of g" is con- 
tained in the support of g and 4>(y) = g(y) + (27rr)~ 2 g"(y), the support of is contained 
in (—a, a) as claimed. 



9 For the family of Dirichlet characters of prime conductor, the 1 -level density is known to be 0(0) for 
support is known up to (—2, 2), and thus is of the desired form. 

10 The convolution is defined by (A * B){x) = f™^ A(t)B(x - t)dt. 

may interpret the relation between / and g as follows. Let X be a random variable with density 
/ supported in (—a/2, a/2). Then g = f * / is the density of X + X, and is supported in (—a, a). 
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For (ii), the Fourier transform of g"(y) is — (2-Ky) 2 'g(y) (the Fourier transform con- 
verts differentiation to multiplication by 2ixix in our normalization). Further g — f * f 
implies g" = f * f". Combining the above, we fmd3 the Fourier transform of 4>{y) = 
g{y) + (2nr)- 2 g"(y) is <j>{x) = g{x) ■ (1 - (x/r) 2 ). 

To complete the proof, we must show 



0(0) {C h(u) 2 du) + h(u)h"(u)du) 



0(0) a(f* h(u)du) 2 

By construction we have 

0(0) g(0) + (2ttt)-V(0) 



(3.5) 



(3.6) 



0(0) g(0) 

Since g is even and monotonically decreasing near the origin (as g has a maximum at 
0), g"(0) < 0. Thus larger values of r should decrease the ratio above, at the cost of 
increasing the size of our window. 

From our construction, as h and / are even we have 

ntr/2 r-cr/2 f 2t\ f 1 



r 1 r j fit\ r 

0(0) = J f(t) 2 dt = 2 J h i^—J dt = a J h(u) 2 du (3.7) 



and 



/Co.) = r' 2 f(t)f(t)dt 



a/2 

2 fflf(t)dt 







a/2 



I h -\h 



a 2 V \ rr 



2t 



ji 



2 i) dt ( since/(t) = /l (?)' r(t) = ^(?)) 



4 Z" 1 

- / h{u)h"{u)du. (3.8) 



a 



As the Fourier transform of a convolution is the product of the Fourier transforms, a 
straightforward calculation yields 

g(0) = f(0)-f(0) = a 2 (j\(u)du^. (3.9) 

Collecting the above equalities, after some elementary algebra we can express the ratio 
0(0) /0(O) in terms of h as 

0(0) = UoHu) 2 du) + (^) 2 Uoh(u)h''(u)du) 
0(0) a ( J,, 1 h(u)du) 2 



As 4> and <f> are even, the Fourier transform of the Fourier transform is the original function <p(x); if 
4> were not even, we would have to replace <fr(x) with <fr(—x). 
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If we set this ratio equal to zero (i.e., if we choose r so that the numerator vanishes) 
then we find3 that on average there are at least r + | normalized zeros in the band 
where 



7rC(7i)cr' irC{h)a' 



1/2 



L h(u) 2 du 



f Q h(u)h"(u)du 



□ 



Proof of the Upper Bound in Theorem \1.3\ The proof is similar to that of the lower bound; 
in particular, once we construct a function ij) with the desired properties then the claim 
follows immediately from straightforward algebra. 

We are thus again reduced to constructing a function with the specified properties. 
For convenience we construct a ip which is not Schwartz, but which is twice differen- 
tiable; a careful analysis of the proof of Theorem [23] shows that this suffices, and thus 
such a ip is sufficient for our purposes. 

Consider the function ip(x) = ( S ™^ CT ) 2 with a compactly supported Fourier trans- 
form given by 

m = (K 1 - f) "y 0,2) 

[0 ]fy$(-<r,<r); 
see Figure [2] for a plot. Away from the origin, the derivative is given by 

,,. . 2sin(cr7ra;) / . , sin(cr7rx)\ 
rj;'(x) = cos(anx) ^ '- . (3.13) 



airx 2 \ otxx 



It is easy to see that the global maximum is at x = and that ip(x) is decreasing 
up to a; = 1/cr, proving the claim for any r < 1/er (though the bound worsens as r 
approaches 1/a as ip(l/a) = 0). □ 



13 In obvious notation, we have J Q h 2 > — (7T(TT cr iticai) 2 Jq hh". We see J Q hh" < 0, and thus 
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Proof of the Random Matrix Theory prediction in Theorem \1.3\ We assume the conjec- 
tures from Random Matrix Theory hold for any even test function, and not just Schwartz 
test functions. We therefore take <p(x) to be the characteristic function of the interval 
[— r, r], which has Fourier transform equal to sm 2 ^^ • 2r. Using such a test function 
simply counts all normalized zeros in our family that are in [— r, r] (there is no weight- 
ing as <p is identically 1 in this interval). Thus the predicted average number of such 
zeros in this interval as R — > oo is 

^(y)%so(y)dy = J ${y) (s(y) + ~ + r^j dy 

r+^j 0(0) +0(0) 

= r + ^ + 2r. (3.14) 

□ 

3.3. Explicit upper and lower bounds. We conclude by determining the upper and 
lower bounds from Theorem 1 1.31 for the average number of normalized zeros in given 
intervals as R — > oo. 

We first consider the lower bound, which means we must maximize C(h) (as it is in 
the denominator for r, the larger C(h) the smaller the window). As the optimal choice 
of h (in a given class of functions) is only slightly better than similar h, we do not spend 
too much time on determining the truly best h. Consider the family of functions given 
by 

h n {x) = (I- x 2 )(l + a 2 x 2 + --- + a 2i x 2i + ■ ■ ■ + a 2n x 2n ) . (3.15) 

We set ao = 1 as the maximum is to occur at x = 0, and since the ratio is invariant 
under rescaling the a/s, we might as well take a — 1. Note that each a 2i+1 = as our 
function is even. We chose h n of this form as this forces h n to be even and to vanish at 
±1. We have 

, / h n (u) 2 du \ 

C(h n ) = /° V ; . (3.16) 

V Jo K{u)K{u)du J 

The optimum value of the square-root appears to be 2/n. For example, when n = 2 we 
must compute 



max 



I 16a2 I 8a 2 _|_ 16Q4 _i_ 16a2«4 _i_ u "4 

15 105 315 315 693 1287 



2 \ 1/2 



(i_t.ii i \ _4 8a2 44a^ 804 80204 52a| 

3 15 105 35 15 231 

/ 6006 + 286a2 + 572a 4 + 70a? + 52a 2 (33 + 5a 4 ) \ 1/2 

= max — ; t_ n ; rr \ (3.17) 

a 2 ,a 4 V39(385 + 121al + 66a 4 + 65al + 154a 2 (l + a 4 ))/ 

this is the quantity inside the square-root, not C(h). Using Mathematica we find the 
optimal values are a 2 w -.233428 and a 4 w .0189588, which leads to C(h) w 0.63662; 
as 2/tt 0.63662, this suggests the optimal value of C(h) might be 2/rr. This yields 
the window (— ^) in which we have on average (as R — >• oo) r + | zeros. 
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Remark 3.4. As we expect the true answer to be a window of size (i.e., we expect to 
be able to take a = oo ), it is not worthwhile to find the true optimum above merely to 
save a bit in a few decimal places. The purpose of this analysis is to show that we do 
see the correct number of zeros on average in the limit in a window of size proportional 
to 1/ a; the actual value of the proportionality constant, while interesting, is in some 
sense immaterial as we believe the density conjecture holds for arbitrary a. 

We list some approximate values for C(h) for other obvious candidates, which are 
all less than the 0. 63662 ( which is approximately 2 /it) found above. 

• h(x) = (1 — x 2 ) 2 has C(h) m 0.57735 (with the quantity inside the square-root 
looking like 1/3); if we take just (1 - x 2 ) we get C(h) = a/2/5 « 0.632456. 

• h(x) = exp(-l/(l - x 2 )) has C(h) « 0.570024. 

• h(x) = exp(- 754212/(1 - x 2 )) has C(h) » 0.575629 (the value of .754212 
was obtained by searching for optimal test functions among exp(— a/(l — x 2 ))). 

We now turn to finding explicit upper bounds for the average number of normalized 
zeros in [— r, r] as R — > oo. We continue to analyze the candidate function ij){x) = 

( ~~~^ J (see Figure [2] for a plot). We have freedom in terms of how we rate our 
approximation; for example, we can decrease the upper bound if we simultaneously 
decrease the size of the interval. 

A natural value to take for the size of our interval is the optimal interval found in the 
lower bound analysis, namely set r « l/2er. As 



(i-S) if|vl<* 



iP(y) = <! CT V "J (3.18) 

otherwise, 



we have ^(0) = l/cr, ^(0) = 1 and V(l/2<r) = 4/vr 2 w 0.810569 (or if we believe the 
approximations, ip(r) = ^ sin 2 (2/7r)). Thus after some algebra we see that the average 
number of normalized zeros in the interval (— is at most ^ (r + ^ + -). 

Appendix A. Standard Conjectures 
At various points in the paper we assume the following conjectures. 

Generalized Riemann Hypothesis (for Elliptic Curves). Let A(s, E) be the com- 
pleted, normalized L-function of an elliptic curve E with function equation s — > 1 — s. 
The non-trivial zeros p o/A(s, E) have Re{p) = 1/2. 

Birch and Swinnerton-Dyer Conjecture HBS-D1[|BS-D2L Let E be an elliptic curve 
of geometric rank r over Q with Mordell-Weil group E(Q) = 711 © T. Then the analytic 
rank ( the order of vanishing of the completed L-function at the critical point) equals 
the geometric rank. 

Tate's Conjecture for Elliptic Surfaces [ITal . Let £/Q be an elliptic surface and 
L 2 (£, s) be the L-series attached to H^(£/Q, Qi). L 2 (S, s) has a meromorphic con- 
tinuation to C and —ord s= iL 2 (S, s) = rank NS(£/Q), where NS(£/Q) is the Q- 
rational part of the Neron-Severi group of £. Further, L 2 (£, s) does not vanish on the 



TOWARDS AN AVERAGE' VERSION OF THE BIRCH AND SWINNERTON-DYER CONJECTURE 17 

line Re(s) = 1. 

Remark A.l. Tate's conjecture is known for rational elliptic surfaces. An elliptic sur- 
face £ : y 2 = x 3 + A(T)x + B{T) is rational if and only if one of the following 
is true: (1) < max{3 degA, 2 degS} < 12; (2) 3 degA = 2 deg£ = 12 and 
ord T=0 T 12 A(T- 1 ) = 0. See ffRSTl . pages 46 - AT for more details. 



ABC Conjecture. Fix e > 0. For co-prime positive integers a, b and c with c = a + b 
and N(a, b, c) = Y[ p{obc p, c < e N(a, b, c) 1+e . 

The full strength of ABC is never needed; rather, we need a consequence of ABC, 
the Square-Free Sieve Conjecture (see llGrll ): 

Square-Free Sieve Conjecture. Fix an irreducible polynomial fit) of degree at least 
4. As N — > oo, the number oft e [N, 2N] with f(t) divisible by p 2 for some p > log iV 
iso(N). 

For irreducible polynomials of degree at most 3, the above is known, complete with 
a better error than o(N) ( HHoll . chapter 4). 

We use the Square-Free Sieve to handle the variations in the conductors. If our eval- 
uation of the logarithm of the conductors is off by as little as a small constant, the 
prime sums become untractable. This is why many works normalize by the average 
log-conductor. 

The following conjecture is used only to interpret some of our results (unless we are 
calculating the 2-level density to distinguish the three orthogonal candidate groups). 

Restricted Sign Conjecture (for the Family T). Consider a one-parameter family 
T of elliptic curves. As N — » oo, the signs of the curves E t are equidistributed for 
t e [N, 2N]. 

The Restricted Sign conjecture can fail (there are families with constant j(E t ) where 
all curves have the same sign, as well as more exotic examples). Helfgott [He] has 
related the Restricted Sign conjecture to the Square-Free Sieve conjecture and standard 
conjectures on sums of Moebius: 

Polynomial Moebius. Let fit) be a non-constant polynomial such that no fixed square 
divides f (t) for all t. Then Ylt=N A t (/(^)) = °(-^0- 

The Polynomial Moebius conjecture is known for linear f(t). 

Helfgott shows the Square-Free Sieve and Polynomial Moebius imply the Restricted 
Sign conjecture for many families; this is also discussed in [Mill J . More precisely, let 
M(t) be the product of the irreducible polynomials dividing A(t) and not c^t). 
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Theorem: Equidistribution of Sign in a Family [HeJ: Let T be a one-parameter 
family with a«(t) G Z[i]. If j(E t ) and M[t) are non-constant, then the signs of E t , t G 
[N, 2N], are equidistributed as N — ► oo. Further, if we restrict to good t, t G [N, 2N] 
such that D{t) is good (usually square-free), the signs are still equidistributed in the 
limit. 
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